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ABSTRACT 

This  paper  studies  the  message  complexity  of  decentralized  commit  protocols.  It 
shows  that  ©(WV/V1/*)  messages  are  necessary  only  if  k  rounds  of  message  interchanges 
are  allowed  It  also  shows  that  0(AflnN)  is  a  message  lower  bound  for  any  decentralized 
commit  protocol.  Finally,  a  class  of  decentralized  commit  protocols  are  proposed  which 
need  6(*AW1/*)  messages  and  use  k  rounds  of  message  interchanges.  If  we  let  k=\nN 
then  we  can  get  a  decentralised  commit  protocol  which  needs  G(AflnJV)  messages  only. 
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1  Introduction 


Decentralized  commit  protocols  are  characterized  by  successive  rounds  of  message  inter¬ 
changes.  Protocols  in  [l,  2]  which  require  a  site  to  communicate  with  every  other  site  at 
each  step  need  N(N  —  1)  and  2 N(N  -  1)  messages  for  each  round,  respectively.  N  is  the 
number  of  sites  in  the  system.  Protocols  in  [4]  need  k2NNl^k  and  2k2  N  N1^  messages  for 
blocking  and  nonblocking  protocols,  respectively.  Where  k  is  a  parameter  dependent  on  the 
number  of  rounds  of  message  exchange.  In  this  paper,  we  define  a  logical  communication 
structure  develop  a  family  of  blocking  and  nonblocking  commit  protocols.  The  protocols 
need  only  kN AT1/*  and  2kN  messages,  where  k  is  the  same  integer  parameter  of  [4]. 
This  communication  scheme  can  also  be  used  for  decentralized  consensus  protocols  [5]  with 
the  same  message  complexity. 


In  section  2,  we  describe  the  formal  model  for  our  commit  protocols.  In  section  3,  we 
describe  the  decentralised  versions  of  two-phase  blocking  and  nonblocking  protocols.  In 
section  4,  we  show  that  the  message  complexity  of  decentralised  commit  protocol.  We 
present  a  class  of  message  optimal  commit  protocols  in  section  ft.  We  also  extend  this  Idea 
to  some  decentralised  consensus  protocols  in  section  6. 
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2  Formal  Model  for  Commit  Protocols 


Transaction  execution  at  each  site  is  modeled  by  a  finite  state  automation  (FSA)  [3].  The 
network  serves  as  a  common  I/O  medium  for  all  the  FSA’s.  A  local  state  transition  at  site  i 
involves  the  reading  of  a  nonempty  string  of  messages  addressed  to  site  i,  writing  a  string  of 
messages,  and  moving  to  the  next  local  state.  State  transitions  at  one  site  are  asynchronous 
with  respect  to  transitions  at  other  sites.  The  final  states  of  the  FSA’s  are  partitioned  into 
two  sets:  the  abort  and  the  commit  states.  A  site  cannot  make  transitions  to  nonabort 
states  from  abort  states.  Similarly,  a  site  cannot  make  transitions  from  commit  states  to 
noncommit  states. 

A  local  state  is  called  commitable  if  occupancy  of  that  state  by  any  site  implies  that  all 
sites  have  agreed  to  commit  the  transaction.  All  other  states  are  said  to  be  noncommitable. 
The  global  state  of  a  distributed  transaction  can  be  represented  by  a  global  state  vector 
consisting  of  the  local  states  of  all  FSAs  and  the  outstanding  messages  in  the  network.  The 
set  of  all  global  states  reachable  from  the  initial  global  state  can  be  represented  by  a  global 
reachable  graph.  Using  this  global  reachable  graph,  it  is  possible  to  find  out  the  set  of  local 
states  that  may  concurrently  be  occupied  by  other  FSAs  when  a  FSA  at  site  i  is  in  some 
known  state  Si.  This  set  of  states  that  can  concurrently  be  occupied  by  FSAs  at  other  sites 
is  called  the  concurrency  set  of  Si. 


3  Blocking  and  Nonblocking  Commit  Protocols 

The  FSA  for  the  decentralized  two  phase  commit  protocol  is  shown  in  figure  1.  We  assume 
that  each  site  has  already  received  a  transaction  in  the  initial  state  (state  q).  If  a  site  decides 
to  commit  the  transaction,  it  sends  ’ ye s’  messages  to  all  the  other  N  —  1  sites  and  moves  to 
state  vo.  If  a  site  decides  to  abort  the  transaction  it  sends  ’no’  messages  and  moves  to  state 
a.  In  state  w,  if  ’yes’  messages  are  received  from  all  other  sites  the  transaction  is  commited 
(state  e)  otherwise  the  transaction  is  aborted,  since  the  concurrency  set  of  state  w  contains 
e  and  a.  Skeen  [2]  has  shown  that  a  protocol  is  nonblocking  if  and  only  if: 

1.  there  exists  no  local  state  whose  concurrency  set  contains  both  commit  and  abort 
states. 

2.  there  exists  no  noncommittable  state  whose  concurrency  set  contains  a  commit  state. 


Figure  1:  Blocking  Protocol 


Therefore  the  protocol  is  a  blocking  protocol.  Figure  2  illustrates  the  decentralized  non- 
blocking  commit  protocol  described  in  [2].  This  protocol  has  a  buffer  state  p  and  an  extra 
round  of  messages  to  make  it  conform  the  requirements  of  Skeen’s  nonblocking  theorem. 


Figure  2:  Nonblocking  Protocol 


4  The  Message  Complexity  of  Decentralized  Commit  Pro¬ 
tocols 

THEOREM  4.1  For  all  decentralized  commit  protocols,  if  there  are  only  k  rounds  of  message 
interchanges  allowed  then  Q(kN Nl/k)  messages  are  necessary. 

Proof:  Assume  each  round  of  message  interchange,  each  site  can  only  write  p  messages 
out.  Therefore,  after  round  1,  there  are  at  most  (p+ 1)  sites  having  the  knowledge  about  the 
local  decision  of  site  1.  After  round  2,  there  are  at  most  (p  +  l)2  sites  having  the  knowledge 
about  the  local  decision  of  site  1.  Therefore,  after  round  k,  there  are  at  most  (p  +  l)fc  sites 
having  the  knowledge' about  the  local  decision  of  site  1.  Since  only  k  rounds  of  message 
interchanges  are  allowed,  (p+  l)fc  should  be  at  least  N .  Otherwise,  these  N  sites  can  not 
enter  a  common  final  state.  Because  during  each  round,  each  site  writes  p  messages  out, 
total  messages  generated  are  Npk. 

(P+l)‘ 

(i’  +  l) 

P 

Npk 
Npk 

D 


>  N 

>  N1,k 

>  N1/k-  1  (1) 

>  kN(Nl/k  -  1) 

=  Q(kNN^k)  (2) 


Theorem  4.2  For  all  decentralized  commit  protocols,  0(iVln  JV)  messages  are  necessary. 

Proof:  All  decentralized  commit  protocols,  which  need  k  rounds  of  message  interchanges, 
require  Q(kN Nl/k)  messages.  Therefore,  minimizing  kNN 1^k  will  get  a  lower  bound  for 
all  decentralized  commit  protocols. 

d(kNN''k) 
dk 

NN'/k  +  kN{^^-)N1/k  = 


0 


(3) 


NN^k  _ 

k  =  In  AT 


min(J kNN1/k)  =  N^^NlnN 
=  cNkiN 

min  (e(kNNl/k))  =  e(N\nN) 


(4) 

(5) 


So,  ©(JVlnJV)  messages  are  necessary  for  any  decentralized  commit  protocol. 
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5  Families  of  Commit  Protocols 

In  this  section  two  families  of  decentralized  commit  protocols  are  presented.  Assume  that  N 
and  k  are  such  that  N1^  is  an  integer  (later  we  consider  the  case  when  it  is  not  an  integer). 
The  N  sites  can  be  treated  as  N  positions  of  a  k-dimensional  array  such  that  a  site  x  can 
be  numbered  by  a  k-tuple  (Xi,X2, . . .,  Xk),  where  0  <  Xi,X2,...,Xk  <  ( N 1fk  -  1)  and 

k 

V*,  0  <  *  <  N  -  1,  *  =  ^  Xj ■,*  N{k~j)/k 

3=1 

5.1  Protocol  Schema  1 

The  FSA  for  schema  1  is  shown  in  figure  3.  The  actions  in  each  state  axe  listed  below.  For 
all  site  *,  *  =  0, 1,.  ..,N  -  1,  x  can  be  numbered  as  k-tuple  (Xi,X2, .  ..,Xk). 

1.  State  q:  On  receiving  a  transaction,  unilaterally  decide  to  commit  or  abort  the  trans¬ 
action.  If  a  ’no’  message  is  received,  decide  to  abort  the  transaction.  If  an  abort 
decision  is  made,  send  ’no’  messages  and  move  to  state  aj.  If  a  commit  decision  is 
made,  send  'yes1'  messages  and  move  to  state  u»j.  All  messages  are  sent  to  sites 
numbered  as  k-tuple  (*,  X2, . . .,  Xk),  where  V  means  any  number  between  0  and 
N 1>k  -  1. 

2.  State  Wi,  1  <  i  <  (&  —  1):  If  all  'yes"  messages  are  received  from  sites  numbered 
as  k-tuple  (Xj, . . . , -Xi-i,  *,.Yi+i, . . . , Xk).  Then,  send  'yes,+1'  messages  and  move  to 
state  ti7j+i.  If  a  ’no’  message  is  received  ,  send  ’no’  messages  and  move  to  state  a,+l. 
All  messages  are  sent  to  sites  numbered  as  k-tuple  ( Xx , . . . ,  X„  Xi+2,  ...,Xk). 

3.  State  wk:  If  all  'yesk'  messages  are  received  from  sites  numbered  as  (Xi, . . .,  Xk-\,  *), 
commit  the  transaction.  If  a  ’no’  message  is  received,  abort  the  transaction. 

4.  State  c:  Commit  state. 

5.  State  a„  1  <  i  <  (k  -  1):  Send  ’no’  message  to  sites  (JYX, . . . ,  Xi,  *,  Xi+2,  ...,Xk)  and 
move  to  state  a,+i . 


6.  State  dk‘.  Abort  state. 


The  following  theorems  establish  the  properties  of  protocol  schema  1. 


Lemma  5.1  If  a  site  is  in  one  of  a  states,  it  toill  eventually  abort. 

Proof:  Since  the  only  transition  in  a  states  is  to  send  ’no’  message  and  move  to  the  next 
a  state.  Hence,  if  a  site  is  in  one  of  a  states,  it  will  eventually  move  to  state  a*  (abort 
state).  □ 


Theorem  5.1  If  a  site  decides  to  abort,  all  sites  will  eventually  abort. 

Proof:  Assume  that  site  x  decides  to  abort  in  State  q ,  where  x  can  be  numbered  as  k-tuple 
(Ai, -Xj, . .  .,-Xfc).  Then  all  sites  will  abort  after  kth  round  of  message  exchange.  We  will 
prove  this  by  induction. 

INDUCTIVE  hypothesis:  If  site  x  decides  to  abort.  After  ith  round  of  message  exchange, 

where  1  <  *  <  k,  all  sites  numbered  as  (*,...,  ..  .,Xk)  have  received  ’ no ’  message 

i—bit» 

and  will  abort.  (The  V  in  any  position  indicates  that  the  element  in  that  position  can  take 
any  value  between  0  and  N 1^k  -  1.) 


1.  Base  case:  When  i  is  equal  to  1.  Since  site  x  decides  to  abort  in  State  q.  It  sends 
’no’  messages  to  sites  which  are  numbered  as  (+,  X2, .  •  -,Xk).  Therefore,  after  the  1st 
round  of  message  exchange  all  sites  numbered  as  (*, X2,  •  •  -,Xk)  have  received  ’no’ 
messages  and  will  move  to  state  a?.  By  Lemma  4.1,  They  will  abort  eventually. 


2.  Inductive  case:  Assume  that  the  hypothesis  is  true  for  i  =  l  —  1,  where  /  <  k.  After 
(/  —  l)th  round  of  message  exchange,  all  sites  numbered  as  Xi, . . . ,  X/c)  have 

(l-l)btts 

received  ’no’  message.  Therefore,  they  will  move  to  a  states  and  send  ’no’  message  to 
all  sites  numbered  as  (*,...,  *,J0+i, ..  .,Xk)-  Such  that  after  Ith  round  of  message 

l-btts 


exchange,  all  sites  numbered  as  (*, . .  .,*, .Xj+i, . .  .,Xk)  have  received  ’no’  message. 

l—bits 


By  lemma  4.1,  They  will  abort  eventually.  So  the  inductive  hypothesis  is  true  for  all 
i,  where  1  <  i  <  k. 


Therefore,  If  a  site  x  decides  to  abort,  after  kth  round  of  message  exchange  all  sites  numbered 
as  will  abort.  So  if  a  site  decides  to  abort,  all  sites  will  eventually  abort.  □ 

k—bita 


Theorem  5.2  If  no  site  decides  to  abort  the  transaction,  all  sites  will  eventually  commit. 

Proof:  If  no  sites  aborts  the  transaction,  all  sites  will  send  'yes1'  and  move  to  state  wi. 
Since  all  messages  are  eventually  delivered,  each  site  will  receive  all  ’yes1’  messages  needed, 
then  send  ’yes3’  message  and  move  to  state  103.  Similarly,  for  all  other  ’yes’  messages. 
Finally,  all  'yesk'  messages  will  be  delivered  causing  sites  to  move  to  the  commit  state.  □ 


Theorem  5.3  Protocol  1  is  a  blocking  protocol. 

Proof:  It  is  obvious  that  the  concurrency  set  of  state  tu*  contains  both  abort  and  commit 
states.  Hence,  protocol  1  violates  the  nonblocking  conditions  states  in  section  3.  □ 


5.2  Protocol  Schema  2 

The  FSA  for  schema  2  is  shown  in  figure  4.  The  actions  in  each  state  Me  listed  below.  For 
all  site  x,  x  =  0, 1, . . . ,  N  -  1,  x  can  be  numbered  as  k-tuple  (Xi,  X2, . . . ,  X*)- 

1.  State  q:  On  receiving  a  transaction,  unilaterally  decide  to  commit  or  abort  the  trans¬ 
action.  If  a  ’no’. message  is  received,  decide  to  abort  the  transaction.  If  an  abort 
decision  is  made,  send  ’no’  messages  and  move  to  state  ai.  If  a  commit  decision  is 
made,  send  'yes1'  messages  and  move  to  state  w\.  All  messages  Me  sent  to  sites 
numbered  as  k-tuple  (*,X2, .  ..,X*),  where  V  means  any  number  between  0  and 
N^k  -  1. 

2.  State  Wi,  1  <  i  <  (Jfc  —  1)  :  If  all  'yes''  messages  Me  received  from  sites  numbered 
as  k-tuple  (Xi, . .  .,X,_i, *, X,+i, . .  .,Xk).  Then,  send  ’yes'*1’  messages  and  move  to 
state  Wi+i.  If  a  ’no’  message  is  received,  send  ’no’  messages  and  move  to  state  a,+i. 

All  messages  Me  sent  to  sites  numbered  as  k-tuple  (Xi, . . . ,  X,,  *,  Xj+2, ...» Xfc). 

11 

1 

1 


q 


prepare 


prepare 


prepare 


prepare* 


Figure  4:  Protocol  Schema  2 


3.  State  u?*:  If  ail  ’ye**’  messages  are  received  from  sites  (Alt . ..,  As_i,  •  ),  send  'prepare1' 
messages  to  sites  numbered  as  k-tuple  (*,  A3, . . . ,  A*)  and  move  to  state  p, .  If  a  ’no’ 
message  is  received,  move  to  state  a*  (abort  state). 

4.  State  pi,  1  <»<(*—  1):  T  all  ' prepare "  are  received  from  sites  numbered  as 

(Ai,..  Then,  send  'prepare'*1'  messuages  to  sites  numbered 

as  (Aj, - Xi,  *,  A,+a, . . . ,  A*)  and  move  to  state  . 

5.  State  pi,:  If  all  'prepare*'  are  received  from  sites  (Aj, .  ■  ■ ,  A*_i,  *),  commit  the  trans¬ 
action. 

6.  State  c:  Commit  state. 

7.  State  o^,  1  <  i  <  (Jfc  —  1):  Send  ’no’  messages  to  sites  which  are  numbered  as  k-tuple 
(Aj,. . ., Xi,*,  Aj+j, . . A*)  and  move  to  state  a,+i. 

8.  State  as:  Abort  state. 

The  following  theorems  establish  the  properties  of  protocol  schema  2. 

Theorem  5.4  If  a  site  decides  to  abort,  all  sites  will  eventually  abort. 

Proof:  Similar  to  proof  of  theorem  4.1. 

THEOREM  5.5  If  no  site  aborts  the  transaction,  all  sites  will  eventually  commit. 

Proof:  Similar  to  proof  of  theorem  4.2. 

Lemma  5.2  If  a  site  is  in  state  c,  all  sites  have  sent  'prepare1  ’  messages. 

Proof:  We  would  like  to  show  it  by  contradiction.  Assume  that  site  x  is  in  state  c  and 
site  y  has  not  sent  'prepare1'  yet.  Where  x  is  numbered  as  (A|, . . . ,  A*)  and  y  is  numbered 

as  (Yu...,Yk).  Since  y  has  not  sent  'prepare1',  all  sites  numbered  as  (•,Yj . V*)  have 

not  received  all  needed  ’ prepare ■*’  messages.  They  should  have  not  sent  'prepare2'  yet. 

Therefore,  all  sites  numbered  as  . Yk)  should  have  not  sent  'prepare3'  yet.  In 

general,  for  all  j,  1  <  j  <  k,  if  sites  Y}, ....  K*)  have  not  sent  ’ prepare J'  then  all 

sites  Yj+i, ..  .,Yk)  should  have  not  received  all  needed  'prepare1'  and  have  not 
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sent  ’prepare’*1’.  Therefor*,  by  induction,  all  sites  V* )  have  not  Mnt  ’prepare*’ 

yet.  So  x  should  have  not  received  ’prepare*’  from  site  (A’1,...,Jf*_i , V*).  That  means  x 
will  have  not  entered  state  c  yet.  It  is  contradictory  to  the  assumption.  Therefore,  If  a  site 
is  in  state  e,  all  sites  have  sent  ’prepare1’  messages.  □ 

TlKORBM  5.6  Protocol  t  is  a  nonblocking  protocol. 

Ptoor: 

1.  By  lemma  4.2,  if  a  sit*  is  in  commit  state  all  other  sites  have  to  be  in  one  of  the  p 
states.  The  abort  state  is  not  reachable  from  the  p  states.  Therefore,  no  local  state 
has  both  commit  and  abort  state  in  its  concurrency  set. 

2.  The  noncommi t able  states  are  state  g,  w  states  and  a  states.  Again,  by  lemma  4  2. 
when  a  site  is  in  one  of  these  states  no  other  ait*  can  be  in  commit  state.  Hence,  there 
is  no  noncommitable  state  whose  concurrency  set  contains  commit  state 

Therefore,  the  nonblocking  conditions  are  satisfied.  c 

5.3  The  Complexity  of  Messstge  Exchange 

Assume  that  N  and  k  are  such  that  .V,/k  is  an  integer  Since  the  number  of  messages 
sent  in  each  round  by  a  site  is  (.Vl/*  -  1 ).  The  total  number  of  messages  needed  by 
the  blocking  protocol  is  kN(Nl^k  -  1)  and  for  nonblocking  protocol  is  2k.V(.V,/*  -  li 
If  .V1/*  is  not  an  integer,  we  can  add  sufficient  virtual  sites  to  the  system  The  only 
difference  between  real  sites  and  virtual  sites  is  that  virtual  sites  do  not  decide  to  abort  the 
transaction  by  themself  or  the  virtual  sites  have  value  0  initially  The  number  of  virtual 
sites  needed  is  (A/  ~  S  ),  where  it  equal  to  f.V1^*')  Now  the  total  number  of  messages 

needed  would  be  hf,Vl/k]  *(  [.V  -  l )  for  blocking  protocol  and  consensus  pro'ocols  and 

2kf.V,/*1*(  [iV'/*]  -  1 )  for  the  nonblocking  protocol  Therefore,  the  order  of  romp!  exit  \  are 
all  9 (k.\  .V'/*).  If  we  ignore  the  ceiling  operator  and  consider  k  to  be  real,  the  number  ot 
message  is  minimum  for  k  =  In  .%  and  the  corresponding  message  roniplexm  i>  Hi  \  In  \ 


Obviously,  these  two  families  of  commit  protocols  are  optimal  with  respective  to  message. 
Because  we  have  shown  that  for  any  decentralised  commit  protocol,  which  has  k  rounds 
of  assesses  interchanges,  requires  ©(tJVJV1/*)  messages.  And  the  message  lower  bound  is 
9(N\»N)  for  any  decentralised  commit  protocol. 


6  Decentralized  Consensus  Protocols 


The  communication  scheme  used  in  section  5  can  be  simplified  to  apply  on  solving  the 
following  problems  efficiently.  Figure  5.  is  the  simplified  schema. 


© 

0 


0 

0 


send  own  value 

perform  computation 
send  result 

wa')  perform  computation 


perform  computation 


Figure  5:  Consensus  Protocol  Schema 

6.1  Decentralised  Finding  Maxima  and  Minima 

Given  N  distinct  values  distributed  across  the  sites  of  a  distributed  system,  such  that  each 
site  contains  one  value.  The  problem  is  to  find  the  extreme  (maximum  or  minimum)  of 
these  values  and  to  make  this  extrema  known  to  all  sites  in  the  system.  The  actions  of  each 
site  is  modeled  as  a  FSA  shown  in  figure  5.  Assume  Nx^k  is  an  integer  then  the  actions  in 
site  x,  which  has  binary  representation  (ij . n),  are  as  follows  : 

1 .  State  q:  Send  own  value  to  sites  numbered  as  ( *,  .Yj, ....  A'* )  and  move  to  state  w\ . 


2.  State  tv,-,  1  <  *  <  (k  —  1):  Upon  receiving  values  from  sites  numbered  as 

(Xi, . .  .,Xj_i,*, Xi+i, . .  .,X*),  compute  the  extrema  (maxima  or  minima)  of  these 
values  received  and  the  computed  result  of  the  previous  state.  Send  the  result  to  all 
sites  numbered  as  (Xi, X,,  *,  X,+2, X/,)  and  move  to  state  tv,-+i . 

3.  State  tv*:  Upon  receiving  values  from  all  sites  numbered  as  (Xi, . . . ,  X*_i,  *),  compute 
the  extrema  (maxima  or  minima)  of  the  values  received  and  the  computed  result  of 
previous  state.  Move  to  state  /. 

4.  State  /:  Final  state. 

The  following  theorem  establishes  correctness  of  the  protocol. 

Theorem  6.1  If  a  site  is  in  final  state  f  then  it  contains  the  extrema  (maxima  or  minima ) 
of  the  values  initially  present. 


PROOF:  Assume  the  extrema  of  the  values  initially  present  to  be  the  value  at  site  x,  say  Vx. 
Then  all  sites  will  contain  Vx  when  they  are  in  state  /.  We  will  show  this  by  induction. 


Inductive  hypothesis:  After  ith  round  of  message  exchange,  where  1  <  i  <  k,  all  sites 
numbered  as  (*, . . .,  *,Xj+j, . .  .  ,X*)  have  received  Vx. 

i-btt* 


1.  Base  case:  *  is  equal  to  1:  Site  x  has  Vx  in  state  q.  It  sends  Vx  to  sites  (*,  X2, . . . ,  X*). 
Therefore,  after  the  1st  round  of  message  exchange,  all  sites  (*,X2, . .  .,X*)  have  the 
extrema  Vx. 


2.  Inductive  case:  Assume  that  the  hypothesis  is  true  for  i  =  /  —  1,  where  l  <  k:  After  the 
(f  -  l)th  round  of  message  exchange,  all  sites  (*,...,*, Xj, ... ,  X*)  have  the  extrema 

Vx.  They  will  send  Vx  to  all  sites  (•,...,*, X/+ i,...,X*).  So  after  Ith  round  of 

I— bit* 

message  exchange,  all  sites  (•,...,*,  Xj+ j, . . ., X*)  contain  the  extrema  Vz  and  move 

I— St  It 

to  state  /.  Therefore,  the  inductive  hypothesis  is  true  for  all  t,  where  l  <  i  <  k. 


The  only  case  a  site  is  in  state  /  is  that  it  already  goes  through  k  rounds  of  message 
exchange,  therefore  it  should  contain  the  extrema.  After  At h  round  of  message  exchange 


all  sites  (■»,  cob  tain  extrema.  So,  If  n  site  is  in  state  /  it  contains  the  extrema  of  the 

k—tii* 

values  initially  present.  U 

0.2  Computation  of  Sum  Function 

Given  N  values  distributed  across  the  sites  of  a  distributed  system,  such  that  each  site 
contains  one  value,  the  problem  is  to  find  the  sum  of  these  values  and  to  make  the  sum 
known  to  all  sites  in  the  system.  As  before,  the  protocol  in  figure  5.  can  be  used.  We  can 
again  assume  N1^  is  integer  and  site  number  x  can  be  representation  as  binary  expansion 
(Jfi, . . . ,  Xk).  The  actions  in  each  state  at  a  site  x  are  explained  below. 

1.  State  q:  Send  own  value  to  sites  numbered  as  (*,  Aj, . . . ,  Xk)  and  move  to  state  ur, . 

2.  State  1  <  »  <  (k  —  1):  Upon  receiving  values  from  sites  numbered  as 

(Jf|, . . .,  Ai_i,  *,  jT|+i, . . .,  Xk),  compute  the  sum  of  these  values  received  and  the 
computed  result  of  the  previous  state.  Send  the  result  to  all  sites  numbered  as 
(Jfi, . . . ,  Xi,  *,  Xi+2, . . . ,  Xk)  and  move  to  state  u>j+j . 

3.  State  Wk:  Upon  receiving  values  from  all  sites  numbered  as  (Xx, . . . ,  Xk-i,  •),  compute 
the  sum  of  the  values  received  and  the  computed  result  of  the  previous  state.  Move 
to  state  /. 

4.  State  /:  Final  state. 

The  following  theorem  establishes  correctness  of  the  protocol. 

Theorem  6.2  If  a  site  is  in  state  f  then  it  contains  the  sum  of  the  values  initially  present. 

Proof:  We  will  show  it  by  induction.  Assume  any  site  x  has  value  V°(x)  initially. 

INDUCTIVE  HYPOTHESIS:  After  ith  round  of  message  exchange,  where  1  <  j  <  k  site  x  will 
have  value  V'{x).  Where 


1.  Bam  cam:  When  t  =  1,  after  the  1st  round  of  message  exchange,  site  x  has  received 
values  from  sites 

. .  .,Afc).  These  sites  have  values  V°(*,  Aj, . .  .,X*)  initially,  therefore,  site  x 
will  contain  sum  of  V°(*,  Xj, .  .  .,X*). 

2.  Inductive  case:  Assume  for  t  =  /  -  1,  where  l  <  k,  the  inductive  hypothesis  is  true. 
After  the  (/  -  l)th  round  of  message  exchange,  site  x  would  have  value  V,-1(x)  which 

is  sum  of  V°(»,. .  Xt, . . . , Xh)-  After  the  1th  round  of  message  exchange  site  x 
(i-t 

would  receive  values  from  sites  (Xj, . . .,  X/_t,  *,  X/+i, . . . ,  X*)  which  have  values 
Vl~1(Xi, . . . ,  X/-i,  *,  Xj+i, . . . ,  Xk),  therefore  V'(z)  would  be 

N't'-i 

V\x)  =  £  Vl~\Xx  ,...,X,.j,o,X/+1 . Xh) 

««o 

=  £  X>°(* . ♦.«,*/+» . Xk) 

=  (?) 
i-ktu 

So  the  inductive  hypothesis  is  true  for  all  i,  1  <  i  <  k. 


The  only  case  a  site  x  is  in  state  /  is  that  it  already  goes  through  k  rounds  of  message 

exchange,  therefore  it  should  contain  the  value  V'*(x).  Vk(x)  =  £  ^°(*,  is  the  sum 

fc— 6 

of  all  values  initially  present,  so  if  a  site  is  in  state  /,  it  contains  the  sum  of  all  values 
initially  present.  □ 


»  .  4  »  4 


7  Conclusions 

We  have  defined  a  communication  structure  for  decentralized  commit  protocols  which  allows 
us  to  derive  a  family  of  decentralized  commit  protocols  and  also  obtain  a  tradeoff  between 
number  of  messages  and  the  number  of  rounds  of  message  exchange.  The  protocols  are 
symmetric  and  need  only  0(Jt NNl/k)  messages  for  k  rounds  of  message  interchanges.  It 
has  been  shown  that  for  any  decentralized  commit  protocol  that  uses  k  rounds  of  message 
interchange  needs  Q(kN N1^)  messages.  Therefore,  we  have  found  a  class  of  message 
optimal  decentralized  commit  protocols.  This  communication  structure  can  be  used  to 
derived  decentralized  consensus  protocols  with  the  same  message  complexity. 
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